05/01/06 13:59 FAX 703 305 3988 US Patent Office TC 2730 @004 

WESTMAN CHAMPLIN & KELLY ® 004 



04/28/2006 10j02_F AX 612334331 2 



-2- 



AM^lTOMSNT TO ™E CLAIMS 

». (Currently A-=ded, X vol** procssiag «^ 

^i,.. voice -eaaoe <VM, data ~£ I 

TCSSag e data indicative of » plurality of voice 

messages ; _ 
« distributed voice data process, coupled to the VM data 
store, configured to access the voice messages, extract 
desired information from the voice messages and augment 
Om VM data stored in the VM data store with the 
desired information; and 
a user interface component coupled to the VM data store and 
configured to provide user access to the augmented VM 
data. 

2. (Original) The system of claim 1 wherein the distributed voice 
data processor comprises t 

a rule application component configured to receive user rule 
inputs indicative of user-selected rules and to apply 
the user-selected rules to the augmented VM data. 

3. (Original) The system of claim 2 wherein the distributed 
voice data processor comprises: 

a speaker identification model data store storing at least 
one speaker identification model,- and 

a speaker identification component configured to access the 
speaker identification model data store and provide an 
indication of an identity of a speaker associated with 
the voice message corresponding to the VM data. ( 

5 



4. (Original) The system of claim 3 wherein the distributed 
voice data processor comprises s 
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a opener model training component configured to receive VM 
P data and train a speaker identification model based on 
the VM data and a user input indicative of a speaker of 
a voice message corresponding to the VM data. 

5. (Original) The system of claim 2 wherein the distributed 
voice data processor comprises t 

an acoustic feature extractor extracting acoustxc 

from the VM data, the acoustic features being 

indicative of the desired information. 

6 (currently Amended) The system of claim ^-5_wherein the 
acoustic feature extractor is configured to extract features 
indicative of a speaker emotion and provide an emotion output 
indicative of the speaker's emotion. 

7 (Currently Amended) The system of claim *-5_vherein the | 
acoustic feature extractor is configured to extract features 
indicative of a speaking rats and provide a rate output 
indicative of the speaking rate. 

8. (Original) The system of claim 7 wherein the distributed 
voice data processor comprises: 

a rate normalization component configured to receive the 
rate output and normalize an associated voice message 
to a preselected speaking rate. 

9. (Original) The system of claim 2 wherein the distributed 
voice data processor comprises; . 

a speech-to-text component configured to generate a textual 
output indicative of a content of a voice message. 
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XO (Original) The system of claim 9 wherein the speech- to- te^ 
cogent is confined to generate a transcription of the voxce 
message as the textual output. 

11. (Original) The system of claim 9 wherein the distributed 
voice data processor comprises: 

a summarization component configured to generate a summary 

of the voice message. 

12. (Original) The system of claim 9 wherein the distributed 
voice data processor comprises! 

a semantic parser configured to generate a semantic parse of 
at least a portion of the voice message. 

13 (Original) The system of claim 2 wherein the rule application 
component sorts voice messages based on the desired informatxon. 

14. (Original) The system of claim 2 wherein the rule application 
component generates alarms based on the desired information. 

15 (Original) The system of claim 2 wherein the user interface 
component generates a user interface exposing user-selectable 
inputs for manipulation of the voice message by the user. 



16. (original) The system of claim 15 wherein the user-selectable 

input b cornpari s e = \ 
a rate changing input which, when actuated by a user, 
changes a speaking rate associated with voice messages;. 

I 

17. (Original) The system of claim 15 wherein the user interfac^ 
displays a textual indication of a content of a voice message. . 
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aisplay. an Ktantity indication l-aictive of an xdent it y of a 
speaker of a voice message. 

"19. (Original) The system of e)M? 4* user interface 

displays an emotion indicator indicative of an emotion of a 
speaker o£ a voice message. 

20. (Original) m. system of claim 15 wherein the user interface 
displays a rule indicator indicative of rules being applied.:; %i 

21 (Original) A method of processing voice messages, comprising:: 
storing the voice messages at a distributed voice message 

(VM) data store? 
intermittently accessing the VM data store to' determine 

whether a new voice message has been stored; 
for each new voice message, processing the new voice message 
at a distributed processor to obtain extracted data 
including speaker identity, acoustic features 
indicative of desired information, and a textual 
representation of a content of the new voice me^ge; 
and 

augmenting data in the VM data store with the extracted 
data. 

22 - (Original) The-Vrietfat^ to^^^^^ ^ e new 

I ' voice message to obtain acoustic f ea^^.^p^rj^es s : 

obtaining acoustic features-; .^^i^pT. an emotion of. ,a 
speaker of the • new /^^^m^ generating |a 

speaker emotion^ ^ut^^^^^ive : ?f the ..speaker's 



i emotion. •• ." .• • «*r-i« 

J - . . , • .. . '.'OA 
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teaturea iacluda a rata indicator ix-Ueatlva of a 

faking ™« of the ap.a*ar c£ tha naw voica meaaaga. and 

further comprising: fl ^nVinfl 
normalizing the speaking rate to a user-selected speaking 



rate. 



24. (Original) The- method of claim 21 wherein obtaining speaker 
identity includes providing an unknown output when speaker 
identity is determined to be unknown and further comprising. 

receiving a user input indicative of a speaker identity for 

the new voice message; and 
tra ining a speaker identification model based on the new 
voice message and the user input. 

25 (Original) The method of claim 21 and further comprising: 

receiving a rules input indicative of user-selected rules to 

be applied to the new voice message/ and 
applying the user-selected rules based on the extracted 
data. 

26. (Original) The method of claim 21 and further comprising: 

semantically parsing the textual representation of the new 
voice message . 

27. (Original) The method of claim 21 and further comprising: 
generating a user interface to the VM. data store, the user 
interface including user -act liable inputs for manipulating the J 
voice messages in the VM data store. 
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